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Abstract 

Reversible computation is one of the most promising emerging tech¬ 
nologies of the future. The usage of reversible circuits in computing de¬ 
vices can lead to a significantly lower power consumption. In this paper 
we study reversible logic circuits consisting of NOT, CNOT and 2-CNOT 
gates. We introduce a set F{n, q) of all transformations ZJ ^ ZJ that 
can be implemented by reversible circuits with (n -|- q) inputs. We de¬ 
fine the Shannon gate complexity function L{n, q) and the depth function 
D{n, q) as functions of n and the number of additional inputs q. First, 
we prove general lower bounds for functions L(n, q) and D{n, q). Second, 
we introduce a new group theory based synthesis algorithm, which can 
produce a circuit & without additional inputs and with the gate com¬ 
plexity 1/(6) < + o(l)) / logj n. Using these bounds, we state 

that almost every reversible circuit with no additional inputs, consisting 
of NOT, CNOT and 2-CNOT gates, implements a transformation from 
F{n, 0) with the gate complexity L(n, 0) x n2" / log 2 n and with the depth 
D(n,0)>2Vl-0(1))/(Slogan). 

Keywords: reversible logic, gate complexity, circuit depth, asymptotic 
bounds. 


1 Introduction 

Reversible logic is essential in quantum computing, but it also has a great po¬ 
tential in designing various computing devices with low power consumption. 
Landauer proved [1] that irreversible computations lead to energy dissipation 
regardless of the underlying technology. Moreover, Bennett showed [2] that 
zero-level of energy loss can be achieved only when a circuit is completely built 
from reversible gates. The main problem is that reversible circuits with fewer 
number of gates [gate complexity) and input count are more practical to use. 
Unfortunately, strict asymptotic bounds for the gate complexity of reversible 
circuits haven’t been found so far. 

Circuit complexity theory goes back to the work of Shannon [3]. He sug¬ 
gested considering a complexity of the minimal switching circuit implementing 
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some Boolean function as a measure of complexity of this function. For to¬ 
day, the asymptotic gate complexity L{n) ^ 2”/n of a Boolean function of n 
variables in a basis of classical gates “NOT, OR, AND” is well-known. 

Reversible computations were discussed by Toffoli in 1980 [4]. He described 
the first reversible gate, 2-CNOT (controlled controlled NOT). After that var¬ 
ious reversible gates (CNOT [5], Fredkin, etc.) were introduced. The subject 
of this paper is reversible logic circuits consisting of NOT, CNOT and 2-CNOT 
gates. A formal definition of these gates from [6] will be used. It is well known 
that any even permutation h G ^(Z^) can be implemented in a circuit with 
n inputs, consisting of NOT, CNOT and 2-CNOT gates [7]. Hence, the gate 
complexity or the depth of this circuit can be considered as a measure of the 
permutation h complexity. 

In this paper we describe a set F{n, q) of all transformations Z 2 —)► Z 2 that 
can be implemented by reversible circuits with (n -|- q) inputs. We estimate 
the gate complexity and the depth of reversible circuit, implementing some 
transformation / £ F{n,q) with q additional inputs (also referred to as an 
additional memory). For this purpose we define the Shannon gate complexity 
function Lin, q) and the depth function D{n, q) as functions of n and the number 
of additional inputs q. 

Using the counting argument, we prove general lower bounds for the func¬ 
tions L(n,q) and D(n,q): 


L{n,q) > 
D{n,q) > 


n2" 


3\og2{n + q) 
n2” 


i{n + q)log 2 in + q) 


( 1 - 0 ( 1 )), 

( 1 - 0 ( 1 )). 


After that we introduce a new group theory based synthesis algorithm, which 
can produce a circuit © without additional inputs and with the gate complexity 
L{&) < 3n2”+^(l-|-o(l)) / log 2 n and the depth D{&) < n2”+®(l-|-o(l)) / log 2 n. 
Finally, using these lower and upper bounds, we formulate the main statement of 
this paper: almost every reversible circuit with no additional inputs, consisting 
of NOT, CNOT and 2-CNOT gates, implements a transformation from F{n,0) 
with the gate complexity L{n,0) x n2"/log2n and with the depth D{n,0) > 
2-(l-o(l))/(31og2n). 


2 Background 

The concept of reversible gates was discussed by Toffoli in 1980 [4]. Gates 
NOT and fe-CNOT and the synthesis of circuits consisting of these gates were 
discussed, for example, in [6]. We will use the following formal definitions of 
NOT and fc-CNOT gates. 

Definition 1. Gate AJ* is a NOT gate with n inputs, which defines the trans¬ 
formation fj : Z 2 ^ Z 2 as follows: 

fjiixi, ■■■ ,Xn)) = {xi, ■ ■ ■ ,Xj - ■ ■ ,Xn) ■ 

Definition 2. Gate CJ) ... = Cf.j, j ^ /, is a generalized Toffoli gate 

(fc-CNOT) with n inputs, k control inputs, which defines the transformation 
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fi-j : Z 2 —>■ ^2 as follows: 


■■■ ,Xn)) = (Xi, • • • ,Xj ©Xii A • • • A • • • ,x„) . 

We will omit an upper index in TV” and C” if the value of n is 

clear from the context. Also we will refer to Nj and as TOF{j) 

and TOF{ii, ■ ■ ■ = TOF{I; j), respectively. It is obvious that in this case 

the equality TOF{j) = TOF[%\j) holds. 

Let’s denote a set of all NOT, CNOT (Feynman) and 2-CNOT (Toffoli) gates 
with n inputs as 

A circuit of gates is usually defined as an acyclic oriented graph with marked 
edges and vertices. In case of reversible circuits of gates from fan-in, fan¬ 
out and random connection of inputs and outputs of gates are forbidden. In an 
oriented graph describing a reversible circuit ©, all the vertices corresponding to 
gates have exactly n numbered inputs and outputs. These vertices are numbered 
from 1 to I and i-th output of m-th vertex, m < I, is connected only to an f-th 
input of (to -|- l)-th vertex. The circuit inputs are the inputs of the first vertex 
and the circuit outputs are the outputs of the T-th vertex. We will also call such 
a connection of gates as composition. 

For every vertex in the graph, i-th input and output are assigned to a symbol 
Xi from some set R = { xi, • • • , }. All symbols can be treated as memory 

registers names (memory cells indices), storing the current computation result 
of the circuit. From definitions (1) and (2) it follows that the value of only one 
memory register can be inverted at a time. This makes an essential difference 
between reversible circuits and irreversible ones. 

Among all the properties of a reversible circuit the most important ones 
for us are the gate complexity and the depth. Let a reversible circuit 6 with 
n inputs be a composition of I gates from © = *j^iTOF{Ij;tj), where 
tj and Ij are the controlled output and the set of control inputs of j-th gate 
respectively. 

Definition 3. The gate complexity L{G) of the reversible circuit © = 
*’j=i TOF{Ij-,tj) is the number of gates 1. 

Classically a circuit’s depth is defined as the length of the longest path from 
an input to an output vertex of the graph, associated with this circuit. In our 
model of a reversible circuit, the associated graph presents itself a single chain, 
so if we use a classical definition of a circuit’s depth, we will get it equal to the 
circuit’s gate complexity. But it is clear that in reality it is not the case. To 
keep our reversible circuit’s model, we introduce an alternative, but equivalent 
definition of a reversible circuit’s depth. 

Definition 4. Reversible circuit © = TOF{Ij\tj) has depth D{&) = 1, if 
for every two of its gates TOF{Ii; ji) and TOF{l 2 ] j 2 ) the following equation 
holds: 

({ti}U/i)n({t2}U/2) = 0 . 

Definition 5. Reversible circuit © has depth D{&) < d, if it can be divided 
into d disjoint sub-circuits with the depth of each equal to 1: 

d 

© = □©'„©', C ©, D(©',) = 1 . (I) 

2 = 1 
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Now we can rigorously define a reversible circuit’s depth. 


Definition 6. The depth D{&) of a reversible circuit © is the minimal number 
of disjoint sub-circuits with the depth of each equal to 1 from the equation (1). 

From the Definition 6 we can derive a simple equation for the depth function 
in case of a reversible circuit © with n inputs: 

L{&) / n < D{e) < L{&) . (2) 


For example, let’s consider a reversible circuit © = Ci -2 * C's;! * N 2 * N 4 * 
Ci,4;2 * N 3 (see Fig. 1). The circuit has six gates, so its gate complexity is 
L{&) = 6. Also, we can divide the circuit into 3 disjoint sub-circuits with the 
depth of each equal to 1: © = (C'i; 2 ) * (C'ap * N 2 * N 4 ) * (C'i 4;2 * N 3 ). So the 
circuit’s depth is D{G) = 3. 
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Figure 1: A reversible circuit © = Ci ;2 * Cs;! * N 2 * N 4 * C'i_ 4;2 * N 3 with the 
gate complexity L{&) = 6 and the depth D{G) = 3. 


From Fig. 1 one can note that our reversible circuit is equivalent to another 
one with the depth equal to 3: ©1 = (C'i;2 * N4) + (Cs;! * N2) * (C'i,4;2 * N3). 
Therefore from here on we will consider, that such circuits © and ©1 are different 
in terms of our reversible circuit’s model, but equivalent in terms of the equality 
of Boolean transformations, defined by them. 


3 Shannon gate complexity, depth and quantum 
weight functions 

It was proved that a reversible circuit with n > 4 inputs defines an even permu¬ 
tation on the set Z 2 [7]. In the same time, it can implement a transformation 
Z™ Z 2 , where m, fc < n, with or without additional inputs. We need the 
following functions to explain this: 

• expanding function (j>n,n+k ■ "^2 defined as 

0n,n-t-fc ((^15 * ‘ , ^n)) — , ‘ * * , Xji^ 0, * • ■ ,0) . 

• reducing function 'i’Z.+k n ■ ^ 2 '*'^ ^2 defined as 

^n-|-fc,n ((^ 1 ! ■ ' ■ I ^n+k)) , ' ' ' , 2:.n-(n) ) j 

where tt is a permutation on the set Zn+k- 

Let us now define a reversible circuit implementing a transformation (see 
Fig. 2). 
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Figure 2: A reversible circuit © implementing a transformation /: Z 2 —>■ Z™ 
with q additional inputs. For every x € Z 2 the equation ,Xn)) = 

(yi, • • • ,ym) holds. 


Definition 7. A reversible circuit &g with (n + q) inputs, defining a trans¬ 
formation g: ^ 2 ’*''^, implements a transformation /: Z 2 ^ Z 2 using 

<7 > 0 additional inputs (additional memory), if there is such a permutation 
TT G S'(Z„+q) that for every x G Z 2 the following equation holds: 

^l+qA9i^^,n+q[^))) = /(x) . 

Note that in this terminology expressions “implements a transformation” 
and “defines a transformation” have different meanings: if a reversible circuit 
&g defines a transformation /, then g(x) = /(x) for all x. If a circuit &g 
implements a transformation /: Z 2 —>■ Z 2 and has exactly n inputs, we will say 
that this circuit implements / without additional inputs. 

Let P 2 {n,n) be the set of all transformations Z 2 ^ Let F{n,q) C 

P 2 {n, n) be the set of all transformations that can be implemented by reversible 
circuits with (n -\- q) inputs. The set of permutations, corresponding to all the 
gates from generates the alternating group A(Z 2 ) and the symmetric group 
S'(Z 2 ) for n > 3 and n < 3 respectively [7]. This implies that F{n, 0) is equal to 
the set of transformations that are defined by all the permutations from A(Z 2 ) 
and from S{1 j2 ) for n > 3 and n < 3 respectively. On the other hand, it is not 
difficult to show that for g > u the equality F"(n, q) = P 2 {n, n) holds. 

Let’s consider a transformation / G F{n,q). Among all reversible circuits, 
consisting of gates from fl^^g and implementing the transformation / with q 
additional inputs, we can find a circuit ©/ with the minimum gate complexity 
and a circuit ©^ with the minimum depth. Let L{f,q) = L{&i) and D{f,q) = 
D{&d). Now we can define the Shannon gate complexity function L(n,q) and 
the depth function D{n,q) as follows: 


L{n,q) 

= , max L{f, q) , 

(3) 

f&F{n,q) 

D{n,q) 

= max D{f, q) . 

(4) 

f&F{n,q) 


If we consider all the gates from 17^ regardless of an underlying technology, 
we can assume that they all have the same technological cost. However, in 
a quantum technology, for example, a technological cost of NOT and CNOT 
gates is much less than a technological cost of a Toffoli gate [8]. Hence, we will 
assume that a gate e from has the weight W (e) depending on the underlying 
technology. 
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We define a quantum weight function W{&) for a reversible circuit G as a 
sum of weights of all its gates. Note that the value of W{&) is not equal to 
the technological cost of a reversible circuit 6, because they may significantly 
differ. But we can state that in most cases a greater value of the function W{&) 
means a greater technological cost of a reversible circuit 6. 

Let’s define the function W{f,q) in a similar way as the functions L(f,q) 
and D{f, q). Then we can define the Shannon quantum weight function W (n, q) 
as follows: 

W{n,q)= max W(/,g). (5) 

feF{n,q) 

Let’s also assume that all NOT and CNOT gates from have the same 
weight and all 2-CNOT gates from have weight If we denote 

the number of NOT and CNOT gates in a reversible circuit G as and 

the number of 2-CNOT gates as then we can derive a simple equality 

for the quantum weight function 

W(G) = . (6) 


Equation (6) means that we should count the number of 2-CNOT gates in a 
reversible circuit separately from the other ones. 

Many reversible logic synthesis algorithms were proposed recently [6, 9, 10, 
11, 12, 13, 14]. For almost every one of them an upper bound for the gate 
complexity of a synthesized circuit is proved. The best known is the upper bound 
L{&) < 5n2” for a reversible circuit G without additional inputs, consisting of 
gates from [13]. We can consider this bound as the best upper bound for 
the function L(n,0): 

L(n,0) < 5n2” . 

Unfortunately, there are no known general lower bounds for the functions 
L{n,q) and D{n,q) for today. In ]7] a lower bound fl(n2"'/ logn) for the func¬ 
tion L(n,0) was proved. In ]15] a lower bound for the gate complexity of a 
reversible circuit without additional inputs, consisting of gates mEXOR, was 
proved. However, the gate complexity of reversible circuits with additional in¬ 
puts was out of the scope. 

The main result of this paper is the following theorems. 


Theorem 1. There is such no € N that for n > uq the following equation holds: 

, 2"(n-2) n 

Lin, q) > —- - - . 

31og2(n-|-g) 3 

The proof of the Theorem 1 will be given in Section 4. 


Theorem 2. There is such no G N that for n > no the following equation holds: 

2"(n- 2) n 

~ 3{n + q)\og 2 in + q) 3{n + q) ' 

Proof. Follows from the Theorem 1 and the equation (2). □ 


Theorem 3. There is such no G N that for n > no the following equation holds: 


W{n, q) > min(lU*'‘"^, ■ f ^ 

\3log2(n + q) 
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Proof. Follows from the Theorem 1 and the equation (6). □ 

Theorem 4. 


L(n,0) ^ 


3n2 


n+4 


log 2 n - log 2 log 2 n - log 2 <^(n) 


(1 + e(n)) 


where (j){n) < n j log 2 n is an arbitrarily slowly growing function and e{n) equals 
to: 


W = 7r7^+(^-«(l) 




Theorem 5. 


D{n,0) ^ 


n2"+5 


log 2 n - log 2 log 2 n - log 2 (fin) 


log 2 n ■ log 2 log 2 n 


(l + e(n)) 


where (fin) < n /log 2 n is an arbitrarily slowly growing function and e(n) equals 

mn) n 


Theorem 6. 


VF(n,0) ^ 


n2"+4 (1T<'^^(1 + ecin)) + + erin)) 


log 2 n - log 2 log 2 n - log 2 (fin) 
where (fin) <n j log 2 n is an arbitrarily slowly growing function and: 

,4 1 A log2log2n 


erin) = (4-o(l)) 


log 2 n ■ log 2 log 2 n 


Proofs of the Theorems 4-6 will be given in Section 5. 

Theorem 7. 

Lin, 0) X n2" / log 2 n . 

Proof. Follows from the Theorems 1 and 4. 


□ 


4 General lower bounds 


As we said earlier, we can implement any permutation h € AiZ^) with a re¬ 
versible circuit without additional inputs, consisting of gates from In pa¬ 
per [16] it was proved that the length L(G', M) of a permutation group G with 
respect to a generating set M has the following lower bound: 


LiG,M) ^ 


l0g|M| \G\ 


In our case we have G = ^(Z^), |G| = (2”)! / 2, |M| = |n^|. Since the cardi¬ 
nality of the set equals to 


= '^in-k) 


n , 


^(1)) 


(7) 
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we can derive a simple lower asymptotic bound for the function L{n, 0): 


L{n, 0) > 


log 2 (( 2 ")!/ 2 ) 

log2(n3/2) 


n2" 

3 log 2 n 


This bound is asymptotically equal to the bound n(n2"/logn) from the pa¬ 
per [7]. 

To derive a general lower bound for the function L(n, g), we should take into 
account all transformations Z 2 —?> Z 2 that can be implemented by a reversible 
circuit with (n+q) inputs. There are no more than P{n+q, n) (an n-permutation 
of (n + q)) of such transformations. 

Let’s now proceed to the proof of the Theorem 1. 


Proof of the Theorem 1. We use counting argument to prove that there is such 
no G N that for n > the following equation holds: 


L{n,q) > 


2"(n-2) 

31og2(n-kg) 


n 
3 ■ 


Let r = |r2^|. From the equation (7) it follows that 


2 

E 

fc =0 


(n — k) 


n^(n — 1) 


-I- 1 < r < 


rt' — n? + 2n 


if n > 2 . 


Let C*(n, s) and C(n, s) be the number of all reversible circuits of gates from 
with the gate complexity s and no more than s respectively. Then the 
following equations hold: 


® j.s+1 _ 1 / t ? 3 \ 

e(n, s) = E = r_i ^ ( Y ) 

e(n.»)s (y) ■(' + ^Y) -if’•>2. 


2 

n^(n — 1) 


As we said earlier, there are no more than P(n -I- 9 ,n) of different transfor¬ 
mations Z 2 Z 2 , that can be implemented by a reversible circuit with (n -|- q) 
inputs. Hence, we can state that 


Q{n + q,L{n,q))-P{n + q,n) > |F(n,(?)| . 

Since \F{n, g)| > |A(Z 2 )| = (2”)! / 2 and P(n + q,n) < {n + g)", it follows that 


(n -K qf 


3\ L{n,q) 


1-t 


n + q — \ 


{n + qT > ( 2”)!/2 . 


There is such no G N that for n > no an equation (2")! > (2" / e)^ holds. 
For such values of n we can state that 


1 -)- 


L(n, q) ■ (3 log 2 (n + q) - 1) + log 2 


n + q — 1J 

+ n\og2{n + q) >2"(n-log2e) . 













From this we obtain a general lower bound for the function L{n, q): 



□ 


In the following section we will give a description of a new group theory based 
synthesis algorithm, which can produce a reversible circuit with asymptotically 
the best gate complexity and without additional inputs. 

5 Synthesis of circuits without additional inputs 

A reversible circuit without additional inputs, consisting of gates from can 
implement only an even permutation. In [6] a group theory based synthesis 
algorithm was described. This algorithm for any permutation h G ^(Z^) can 
produce a circuit © implementing h with the gate complexity L{&) < 7n2". 

Let us now describe a new synthesis algorithm which use a similar technique 
as the algorithm from [6], but has a better upper bound for the gate complexity 
of a synthesized circuit. This algorithm’s description will be given in a form of 
the theorem proof. 

Proof of the Theorem f. We will describe a new group theory based synthesis 
algorithm, which for any permutation h G ^(Z^) can produce a circuit © im¬ 
plementing h with the gate complexity 




L(©)< 



where 4 >{n) < n j log 2 n is an arbitrarily slowly growing function and the function 
e(n) equals to 



Let’s consider a permutation h G ^(Z^) and the transformation //j: Z 2 —>■ 
Z 2 , defined by it. The main idea is in a decomposition of h into the product of 
transpositions in such a way that all of them can be grouped by K independent 
transpositions: 


h = Gi o G 2 o ■ ■ ■ o Gt o h' , 


( 8 ) 


where Gi = (xi^i,yi^i) o ••• o {yii^K,yi,K) is an i-th group of K independent 
transpositions, G Z 2 and h' is a residual permutation. Our goal is to 

show that a group Gi can be implemented by a composition of a single multiple 
controls Toffoli gate and many CNOT and 2-CNOT gates. 

The permutation h can also be represented as the product of independent 
cycles with the sum of cycles lengths no more than 2". Having this permutation 
representation, we can obtain independent transpositions from the cycles as 
follows: 

(* 1 , • • • Or) O (jl, • • • ,js) = (HO2) O (jl,j 2 )o 

0r)o(ji,j3,--- ,js) ■ (9) 
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(ii, • • • ,v) = (*i,i 2 ) o (i3,i4) o (*i,i3,*5,*6, • • • ,*r) for r>5. (10) 

If we look at the permutation representation (8) and the equations (9)-(10), 
we will see that K independent transpositions can’t be obtained from a residual 
permutation h' only if it has less than K independent cycles and every of these 
cycles has the length less than 5. Hence, a sum of the cycles lengths of the 
permutation is no more than 4:{K — 1). 

Let Mg be the set of non-fixed points of a permutation g G 5'(Z2): 

Mg = {x G I g{x) . 

Then we can state that \Mh\ < 2", \Mhi \ < 4,{K — 1). 

If we apply the equation (8) to the permutation h' providing that K = 2, 
we will see that this permutation can be represented as a product of no more 
than \Mh'\ / 2 independent transpositions pairs and one cycle with the length 3 
at most. Every cycle with the length 3 also can be represented as a product of 
two independent transpositions pairs: 


(bj)*) = ((b j) o (bs)) o ((r,s) o (i,fc)) . 

Let be a permutation which is represented as a product of i indepen¬ 
dent transpositions and fg(i) be the transformation defined by this permutation. 
Now we can derive an upper bound for the function L{fh,Q), where fh is the 
transformation, defined by a permutation h G ^(Z^): 

Hh, 0) < ^ • L(/gW, 0) + + 2^ ■ L(/g(.), 0) , 

2^ 

Lifh, 0) < — • L(/g(K) ,0) + 2K- L{fg,., , 0) . (11) 

All we should do now is to find out an upper bound for the function L{fg(K) , 0). 

Let’s consider an arbitrary permutation g^^^ G ^(Z^). Let k be the cardi¬ 
nality of the set Mg(K): k = |Mg(ii:) |, then k = 2K. The essence of the proposed 
synthesis algorithm is in a conjugation of the permutation g^^^ in order to get 
a permutation corresponding to a single fc-CNOT gate. Every gate e from 
defines a permutation he, for which h~^ = he- This means that conjugating 
gi^) by he corresponds to the attaching the gate e to the front and back of a 
current sub-circuit. 

Let g^^^ = (xi, yi) o • • • o {xk, Yk)- We define a matrix A as follows: 


/xi \ 


/ ai,i 

^1,71 

yi 


02.1 

0-2,n 



Ofe-l.l 

^k — 1, 

W/ 


\ Ofc.l 

^k,n 


( 12 ) 


Let k be the power of two: 2 L*°S 2 = k. If fe < log 2 n, then we can state that 

no more than 2^ and no less than log 2 k pairwise distinct columns exist in the 
matrix A. Without the loss of generality we can assume that all d <2^ pairwise 
distinct columns are the first ones. Then for every j-th column, j > d, there 
is equal to it an i-th column, i < d. If we conjugate g^^^ by the permutation, 
corresponding to the gate Ci-j, we will zero out a j-th column in the matrix A. 
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We do this for all the columns whose index is greater than d using Li = 2{n — d) 
CNOT gates. In result we obtain a new permutation and a new matrix Ai 
for it as follows: 


Ai = 


02.1 

n 

ai,d 0 • 

02.d 0 • 

—d 

^ \ 
• • 0 


Ofc-1,1 

1 

0 

• • 0 


V ak,i 

^k,d 0 

• • 0 ; 


= 1 we conjugate g[^^ by the permutation corresponding 


to the gate W in order to zero out the first row of the matrix Ai. We need 


L 2 = 2d NOT gates to do this. In result we obtain a new permutation g 2 
a new matrix A 2 for it as follows: 


(K) 


and 


\ 


/ 

Next step is reducing the matrix A 2 to a canonical form, where every row, 
after reversing the order of its elements, will represent itself the binary expansion 
of row index minus 1 . 

All the rows in the matrix A 2 are distinct. The first row is already in the 
canonical form, so we will successively transform the rest of the rows beginning 
from the second one. Let’s assume that the current row has an index i and all 
the rows with indices from 1 to (i — 1) are in the canonical form. There are two 
cases: 

1. There is nonzero element bij in the i-th row with an index j > log 2 k. In 
this case for every element bi^i, j' 7 ^ j, j' < d, which is not equal to the 
j'-th bit in the binary expansion of the number (i — 1 ), we conjugate g^^^ 
by the permutation corresponding to the gate Cj-^/. This will require no 
more than 2d CNOT gates. To make the current row canonical, we should 
now zero out only the j-th element of it. It can be done with conjugating 
52 ^^ by the permutation corresponding to the gate Cjj, where I is the set 
of nonzero bits indices in the binary expansion of the number (i — 1). For 
example, if i = 6 , then J = { 1, 3 }. Since |/| < log 2 fc, we can replace this 
multiple controls Toffoli gate by a composition of no more than 8 log 2 k 
Toffoli gates [8], thus we need no more than 16 log 2 k Toffoli gates for this 
part. 

So, summing up, in this case we need < 2c? + 16 log 2 k gates from 17^ 
to transform the i-th row to the canonical form. 

2. There is no nonzero element in the Tth row with an index j > log 2 fc: 
bij = 0 for all j > log 2 k. In this case, we conjugate g^^^ by the permu¬ 
tation corresponding to the gate C/;iog 2 fc+i, where I is the set of current 


/ 0 

^ 2,1 


A 2 = 


bk-1,1 

V ^k,l 


n — d 


0 

b2,d 


bk-i,d 0 
bk,d 0 
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row nonzero elements’ indices. Because of inequality of matrix rows and 
because all the previous rows are in the canonical form, we can state that 
the value of fcj.iogj fc+i will be inverted only if j > i. Since |/| < log 2 k, 
we can replace this multiple controls Toffoli gate by a composition of no 
more than 81og2 k Toffoli gates [ 8 ], thus we need no more than 161og2 k 
Toffoli gates for this part. After that we can go to the previous case. 

So, summing up, in this case we need <2d + 32 log 2 k gates from 
to transform the i-th row to the canonical form. 


As we can see, we obtained a new restriction to the value of k: log 2 k should 
be strictly less than n, otherwise we will not be able to transform the matrix A 2 
to the canonical form. After this transforming, we obtain a new permutation 
and a new matrix A 3 for it as follows: 


log2 fc 


n—log2 k 


A3 = 


/ 0 0 0 ••• 0 0 ••• 0 \ 

1 0 0 ••• 0 0 ••• 0 


0 1 1 
V 1 1 1 


1 0 
1 0 


0 

0 


For this transformation we need L 3 gates from 


k 

L 3 = ^ < k{2d + 32 log 2 k) . 

i=2 


Finally, for every i > log 2 k we conjugate by the permutation corre¬ 
sponding to the gate Ni. We need L 4 = 2(n — log 2 k) NOT gates to do this. In 
result we obtain a new permutation g^'^ and a new matrix A 4 for it as follows: 


A4 


log2 k n—log2 k 

/ b 0 0"^ b 1 \ 

1 0 0 ••• 0 1 ••• 1 


0 1 1 ••• 1 1 ••• 1 

\ 1 1 1 ••• 1 1 ••• 1 / 


The permutation g^^^ corresponds to the single gate .log^k+i-.i- 

This gate has (n — log 2 fc) control inputs, thus it can be replaced by no more 
than L 5 < 8 (n — log 2 fc) Toffoli gates [ 8 ]. 

We obtained the permutation g^^^ with the help of conjugation the permu¬ 
tation g^^^ by specific permutations. If we conjugate g^^^ by exactly the same 
permutations, but in a reverse order, we will obtain g^^K In terms of a circuit 
synthesis this means that we should attach all the gates we used in our matrix 
transformations to the gate Cn^n-i, -- ,iog 2 k+i;i from left and right, but in a re¬ 
verse order. As a result we will obtain a reversible circuit &k, which defines 


12 










the permutation From this it follows that < L{<5k) and 


5 

Li < 2 {n — d) + 2 d+ 

i=l 


+ k{2d + 32 log2 k) + 2{n 


log2 k) + 8 {n 


log2 k) , 


L{g^^\ 0) < 12n + k2^'^^ + 32fc log2 fc — 10 log2 k . 

Also, L{g^‘^\0) < 12n + 364. 

Using these upper bounds in the equation (11), we obtain the following upper 
bound for the function L{fh,0): 

ora+l 

L{fh, 0) < —;—(12n + A:2^+^ + 32/clog2 k — 101og2 k) + fc(12n + 364) . 
k 

Our synthesis algorithm requires k to be the power of two and log2 k to be 
strictly less than n. Let m = log2 n — log2 log2 n — log2 </)(«) and k = 2L*°S2 H ^ 
where (j}{n) < n j log2 n is an arbitrarily slowly growing function. Then m/2 < 
k < m and 

2^+2 

L{fh, 0) < -(12n + 2m2"‘ + (32 — o(l))mlog2 m) , 

m 

log2 n ■ log2 log2 n 
n 



Hfh,o) < 


3n2"+4 


m 


1 + 


2"*log2n /8 


6n 


+ 15-0(1) 


From this we obtain the final upper bound for the function L{fh, 0): 

3n2”+4 


Lifh.O) < - - 1 'i 

log2 n - log2 log2 n - log2 (/(n) 

where the function e{n) equals to 


(l + e(n)) , 


(13) 


^o(l) 

Since our synthesis algorithm can produce a reversible circuit © for an ar¬ 
bitrary permutation h £ it follows that the function L(n,0) is upper 

bounded by the same value as L{fh,0). □ 


log2 n ■ log2 log2 n 


e(n) = 


6(/)(n) 


-f 


To explain the main part of our synthesis algorithm, let’s consider a permu¬ 
tation = ((1, 0,0,1), (0, 0,0, 0)) o ((1,1,1,1), (0,1,1, 0)). This permutation 
can be implemented by a reversible circuit © = Ci;4*C2;3 *Ni*N 3 *N 4 ** 
N 4 * N 3 * Ni * C'2;3 * Ci- 4 . The process of obtaining the circuit © is showed in 

Fig. 3. 

The proposed synthesis algorithm allows us to prove the Theorem 5. 


Proof of the Theorem 5. We should prove the following equation: 

n2"+5 


D{n, 0) < 


log2 n - log2 log2 n - log2 (/(n) 


(1-f e(n)) 


where (f>{n) <n j log2 n is an arbitrarily slowly growing function and e(n) equals 
to 

A(p(n) n 
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Figure 3: The process of obtaining a reversible circuit ©, implementing a 
permutation = ({1,0, 0,1), (0,0, 0,0)) o ((1,1,1,1), (0,1,1, 0)). 


This can be easily done, if we take into account that some of operations in 
the proposed synthesis algorithm can be done with the logarithmic depth. For 
example, we can zero out duplicating columns in the matrix with the logarithmic 
depth (see Fig. 4). Also, a conjugation by permutations, corresponding to NOT 
gates, can be done with the constant depth. 


'T* 'T* 'T* 'T* 'T* 'T* 'T* 'T* 

•jU JU JU JU JU JU JU JU 



5 < 

r 

K 

5 < 

K 

5 < 

r 



K 

J 

r 
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k 

J 


xOOOOOOO 


Figure 4: Clearing duplicating inputs with the logarithmic depth. 

This implies that Di = 2 |’log 2 (n — d)] (against Li = 2(n — d), see page 11), 
D 2 = 2 (against L 2 = 2 d, see page 11 ) and D 4 = 2 (against L 4 = 2 (n — log 2 k), 
see page 12). All other parts of our synthesis algorithm produce sub-circuits 
with the depth equal to the gate complexity: O 3 = L 3 < k( 2 d -I- 321og2fc), 
D 5 = < 8 {n — log 2 k) (see page 12 ). 

Using these depth values, we can derive the following upper bound: 

5 

Di <2 log 2 n + ki2^'^^ + 32 log 2 k) + 8 (u — log 2 fc) -|- 6 . 

i=l 

Also, 0 (g( 2 ), 0) < 8 n -k 2 log 2 n + 374. 

Using these upper bounds in the equation (11) for the circuit depth, we 
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obtain the following upper bound for the function D{fh,0): 


nra+l 

D{fh,0) < —^(8n + 21og2 n + + (32 - o(l))A:log 2 k) . 

r\j 


Providing m = log 2 n — log 2 log 2 n — log 2 (()(n) and k = where 

4 >{n) < n I log 2 n is an arbitrarily slowly growing function, we obtain an upper 
bound for the function D{fh,0)'. 




D{fh,0) < 


4n 4n 
n2”+5 


log 2 n - log 2 log 2 n - log 2 (j)(n) 


(1 + e(n)) 


where the function e(n) equals to 


i) = 


4(^(n) 


+ (4-0(1)) 


log 2 n ■ log 2 log 2 n 


Since our synthesis algorithm can produce a reversible circuit © for an ar¬ 
bitrary permutation h € ^(Z^), it follows that the function D(n,0) is upper 
bounded by the same value as D{fh, 0). □ 


Now we should prove the last theorem of this paper. 


Proof of the Theorem 6. We should count the number of NOT, CNOT and 2- 
CNOT gates in a reversible circuit, synthesized by the proposed synthesis algo¬ 
rithm, to prove an upper bound of the theorem 


W{n,0) ^ 


n2"+4 (W(C) (1 + ec{n)) + 2W(t) (1 (n)) 


log 2 n - log 2 log 2 n - log 2 4>in) 
where (j){n) < n / log 2 n is an arbitrarily slowly growing function and 


ec{n) = 


1 






log 2 log 2 n 


€T(n) = (4-o(l)) 
We can see that 


log 2 n ■ log 2 log 2 n 


Lp) = 2{n - d) (CNOT gates only) , 
= 2d (NOT gates only) , 

< 2kd (CNOT gates only) , 

^ 4 ^^ = 2(n — log 2 k) (NOT gates only) 


(T) 

n 

(T) 


= 0 ; 
= 0 ; 


< 32/c log2 k ; 


L 


lP = 0 


(T) 

4 

(T) 


= 0 ; 

< 8(n - log2 k) . 
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Summing up, we obtain the following upper bounds: 


5 

L('=)(9(« 0) < jjfi) £ 2 {ji — d) -\-2d-\- 2kd + 2(n — log 2 k) , 

i=l 


2,(T)(^(if)^ 0) < ^ < 32fclog2 k + 8{n- log 2 k) , 

i=l 

( 5 ^^), 0) < 4n + fc2'=+i - 2 log 2 k , 

2 ^(t) (g(^), 0) < 8n + 32fc log 2 k — 8 log 2 k . 


Also, 0) < 4n + 124 and 0) < 8n + 240. 

Using the equation (11), we obtain the following upper bounds for the func¬ 
tions and L^^'>{fh,0): 

on-l-l 

L^^Hfh,0) < ^(4n + A:2'=+i -21og2fc) + fc(4n + 124) , 
k 

on+l 

L^^Ufh, 0) < —;—(8n + 32fclog2 k — 81og2 k) + k(8n + 240) . 
k 

Providing m = log 2 n—log 2 log 2 n—log 2 <('(«-) and k = 2^^°^^ '"J, where 4>{n) < 
n / log 2 n is an arbitrarily slowly growing function, we obtain the following upper 
bounds for the functions and {fh,0): 


L^^\h, 0 )< 


n2’"+4 

log 2 n - log 2 log 2 n - log 2 (j){n) 
n2"+5 

log 2 n - log 2 log 2 n - log 2 (/>(n) 


(1 + ec(n)) 
(l + eT(n)) 


where the functions ec{n) and eT{n) equal to 


ec{n) 

erin) 


1 

2 (j){n) 



l0g2 l0g2 ^ 

n 


(4 _ j(-^)) ^°g2^-^0g2l0g2^ 
n 


Since our synthesis algorithm can produce a reversible circuit 6 for an ar¬ 
bitrary permutation h G ^(Z^), it follows that the function tU(n,0) is upper 
bounded by the same value as W{fh, 0). From upper bounds for the functions 
L^^'>{fh,0) and and from the equation (6), an upper bound for the 

function lU(n,0) from the Theorem 6 follows. □ 


From the proof it follows that the ratio of the numbers of gates NOT, CNOT 
and Toffoli in a synthesized circuit is approximately equal to 1:1:4. 


6 Conclusion 

We have discussed the problem of general synthesis of a reversible circuit with¬ 
out additional inputs, consisting of NOT, CNOT and 2-CNOT gates, with 
the lowest possible gate complexity and depth. We have studied the Shannon 
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gate complexity function L{n,q), the depth function D(n,q) and the quantum 
weight function W(n, q) for a reversible circuit, implementing a transformation 
/: ^2 —^2 from the set F{n, q) without additional inputs. 

From the lower bounds of these function we can see that using additional 
inputs should reduce the circuit’s gate complexity and the depth. This is in 
line with respective practical evaluations as e.g. conducted in [18, 19]. Also, 
in paper [20] an upper asymptotic bound 2" for the function Lin, q) in case 
of using additional inputs was established. This bound is asymptotically lower 
than our bound for L(n,0), but a significant number of additional inputs in a 
reversible circuit is required to achieve it. 

When solving the problem of reversible logic synthesis one should find a 
compromise between the gate complexity, the depth (working time) and the 
amount of used memory (additional inputs) of a reversible circuit. Further 
research should establish a more precise relationship of these parameters from 
each other. 
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